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WHy (EST INTELLIGIBILITY” 


Barbara F. Grimes 


“If you ın a tongue utter speech that is mot intelligible, 
how will anyone know what is said? For you will be speaking into 
the air. There are doubtless many different languages ir the 
world, and none is without meaning; but if 1 do not know the 
meaning of tne language, I shall be a foreigner to the speaker 
and the speaker a foreigner ta mi 


Tne Apostle Paul argued thus with the Corinthian Christians 
in the First Century A. D. (1 Cor. 1419-11) because meaningful 
Ctwnunication was basic for him (B, F. Grimes ms.). And so it 
is, or should be, for governments interested in communicating 
with their citizens; for educators preparing materials, training 
teachers, and teaching students; for community develooment agents 
trying te brang about improvements in agriculture, health, or 
literacy; for counsellors dealing with family or other soctal 
problems; for linguists, antnropelegists, and other scholars 
engaging in research; and for missionaries and Bible translators 
trying to communicate life-changing concepts. 


Adequate communication in a multilingual setting involves a 
responsible language choice by the communicator. Decisions about 
which language to use, however, are often made on the basis of 
what is easiest for the communicator, rather than on what 
communicates most effectively with the hearers, The communicator 
probably already knows, or has to learn, the national language or 
lingua franca, and literature may already exist in that language. 
It may seem an unnecessary burden to have te learn the vernacular 
speech of his audience as well. 


For effective communication, however, language decisions 
need to be based on accurate evaluations of how well people 
understand their second language or other dialects of their first 
language, and on attitudes in the various segmenta of society 
about the acceptability of using other languages or dialects they 
may understand. 


Suggestions about how to evaluate degrees of second language 
proficiency in a community nave been given by B. F. Grimes 
{1986a) and Quakenbush (1986), Methods for evaluating language 
attitudes have been described by Agheyisi and Fishman (1970). 
The relative importance of; comprehension versus language 
attitudes has been discussed by{Grimes (1982, 1985a), How to 
test intelligibility among related dialects and languages ham 
been described in detail by Casad (1974), 


This paper attempts to show that other methods for getting 
at dialect intelligibility which have the apparent advantage of 
being faster than intelligibility testing, and which may have 
validity for some other purpo are inadequate for evaluating 


Mralect intelligibility, Because of the indirectm of thm 
approaches te intelligibility, the results from those metnogs 
stall emo to be verified for each dialect oair separately by the 
more direct method of intelligibility testing am descrioed by 
Fasad. 


Published Lists of Languages ara Dialects 


Is it not possible to get an accurate picture of what are 
separate languages and what are dialects of those languages by 
using lists of languages and dialects that are already avai ladle? 
Unfortunately, such lists often contain inaccuracy 


‘Dialect’ versus ‘Language’. The use of the term ‘dialect’ may 
mot indicate that the form of speech 1s inherently intelligible 
mith another form of speech, but rather may be an example of 
popular use, which attributes dialect status to anything that is 
net a major language, Similarly, the term ‘language’ may be used 
ir a Popular sense only for languages with official status which 
have many speakers. 


Ethnic Nane: Names of ethnic or people groups or tribi oo not 
always correlate on a one-to-one basis with names of languages. 
An ethnic name may include several languages, some of which may 
net even be related, or closely related, to each other. The 


Yugur official ethnie minority im China, also called Yellow 
Uighur, is made up of two distantly related languages (Jan-Olof 
Svantesson, personal communication): West Yugur, whicn is a 
Northern Turkic language (Voegelin and Voepelin 1977.340), and 
East Yugur, which is a Mongolian language (Poppe 1970). Botn are 
distinct from Uighur, which is Eastern Turkic. Similarly, the 
Slack Bobo (Bobo Fang) and Red Bobo (Bwamu and Boomu) of Burkina 
Faso and Mali are in the Mande and Gur families of Niger-Congo, 
respectively. These and other similar cases are sonetines merged 
or confused in lists. 


On the other hand, distinct ethnic names may identify groups 
whose speech is inherently intelligible to each other. and nence 
are linguistically dialects, but which are listed as separate 
‘languages’ because ef politica) and other cultural differences, 
such as the Sotho, Pedi, and Tswana of southern Africa, or the 
Zulu, Swazi, and Xhosa, also ef southern Africa. 


Even for groups which do have a one-to-one corretation 
between people names and languages, there may be distinct names 
for the people and their language, for example, for Mamprusi and 
Mampruli, Kusasi and Kusaal, Kasena and Kasem, Dagbampa or 
Dagomba and Dagbani, Dagaaba and Dapaarı, all of Ghana. Some 
sources list such pairs as two languages, as for Tamprusi ana 
Tampulma of Ghana (Voegelin and Voegelin 1977. 151). 


Traditional Names. Historic nai may continue to be used for 
Groups, with the separate varieties being referred to as 
‘dialects’, even though the traditional name may include several 
inherently unintelligible forms of speech. This is true of the 
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On the other hand. language names may be used in a 
qeographic sense: "Butor? may be applied outside of Southeast 
Sulawesi, Indonesia, to anyone from that region, and 'Javanese* 
way be used outside of Java for anyone from that island, whether 
they speak Butonese or Javanese or some other language. 


Village Names. Early lists of languages im Papua New Guinea and 
Irian Jaya contained some village names which did rot necessarily 
represent separate dialects or languages. 


Alternate Names. Lists may include variant mames for the sare 
language as though they were separate languages, such as Aucaar 
and Djuka of Surinam (Voegelin and Voegelin 1977.143). Early 
explorers in Papua New Guinea found some language groups that did 
not have names, so they arbitrarily assigned ts them their ovr 
word for ‘house’ Other explorers assigned to the same languages 
their word for ‘man’. In other places, a word by which dialects 
differ nas been assigned to each as its name. Some groups have 
been given derogatory names (Chontal, Araucanian, Auca, Jivaro), 
which in some cases have become the official name for them. 
Nearby groups often use different names for a given neighbor, 
while they use still a different one for themselves. Other 
groups are distinguished by the color of the women's blouses or 
the coler of headgear, such as the White Tai, Red Tai, Black 
Karen, Striped Karen, Blue Meo, Green Meo, Flowery Miao, ever 
though such distinctions may or may not coincide with language 
differences. 


Census Lists. Fasold (1984.23) describes some of the problems 
with language names found in census information. “For some 
people, it may be the first time they have even had to think 
about what language they speak. The answer is likely to be based 
more on the desire to be associated with a prestige language, to 
appear patriotic, or to claim allegiance to a local ethnic 
group... some people have a very local self-concept. Not knowing a 
namg for their language, and unwilling for one reason or other to 
nare ong of the major languages, they give the name of their 
caste, locality, or even their occupations as the name of thei 
language. It is now up to the census officials to decide how to 
count such a response.“ 


Extinet Languages. Lists may say that certain languages are 


extinct which are actually extant, such as Urarina of Peru, 
Chamicuro of Peru, Apiaca of Brazil, Culina of Brazil and Peru, 
Saliba of Colombia and Venezuela (Voegelin and Voegelin 1977.22, 
24, 95, 282, 132), Others may be listed as nearly extinct which 
are still very much alive, such as Sea Island Creole (Gullah; 
Voegelin and Voegelin 1977. 143). 


Language Groups. Linguists sometimes use the term ‘language’ in 
a larger sense than a group of inherently intelligible dialects, 
for what is really a group of related languages. This usage may 
be based on linguistic similarities, ethnic relationships, 
language standardization, tradition, or a combination of these. 
Some linguists refer to Guechua of South America as a single 
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language (Voegelin and Voegelin 1977.21). Other exanoles of 
usage are Berber (Gunnemark and Kenrick 1995.11, 36, 38. 40, 
55), and Maya (Gunnemark and Kenrick 1985,39. 


herent Int. ibility. Linguists may treat anm ceqree of 
inherent intelligibility as indicating that variants ave dialects 
of a Single language. Voegelin and voegelir (1977.357. say 
regarding Zapoteco of Mexico, “Longacre (19364) s sever 
different languages, among some of which (according to se Pr gule 


and Freeland (1933)) there exists mutual intelligibility, and 
hence different dialects rather thar differert langusges. As 
already mentioned, lack of adequate intelligibility distingu: snes 
55 Zapoteco languages, rather than marginal ce lis.ted 
intelligibility uniting them. 

It seems clear from these examples that it is ru poss _sle 


to use lists of ‘languages! and ‘dialects’ te incicate -~-ere 
there is adequate intelligibility and whore there is net. I5 is 
necessary to do further investigation in order to uncerstanc the 
real relationships. 


Comparing Woro Lists 


Probably the most frequently used method for establis-ing 
dialect closeness in survey situations is through comta-ing nord 
lists. Swadesh (t950, 1952, 1953, 1955) proposed lists of 100 
and 200 words that he said were those words least likely tc -ave 
changed through time. He was primarily trying to dete-mine tne 
time depth since larguages diverged, believing that linguistic 
change is uniform through time from one language to an 
Most linguists have rejected the idea that there is sich a 
regular relationship of time to linguistic change (ucsen:~sky 
1955, 1986a, 1956b, Hoijer 1956, Rea 1958, among others). 
However, his lists and modifications of them have beer usec in 
many parts of the world for dialect and language compe-:sons. 


Swadesh suggested that the dividing line between c:alacts of 
one language and separate languages was 81% lexical sititasity, 
based on averaging what was known about some languages though 
comparative linguistics and through archeology. He suggestec 
that comparison of probable cognates by inspecting werc lists of 
dialects and languages could give a resulting pe 
similarity, called ‘lexicostatistics', that would accronimate a 
figure for true cognates, arrived at through tne moarative 
method, and would be a faster way to determine lars sage 
relationships than the comparative method. 


Comparative studies have been undertaken mainly t= “inc the 
closeness of phonological systems, and to unde-stana the 
phonological processes which take place as languases ects 
diverge. Swadesh invented lewicostatistics te try the 
time depth since divergence. Comparing word lists 
percentage of lexical similarity and therefere to aroroy: mate 
intelligibility between dialects is a still different use. 
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personal communication) and Mayan languages of Central America 


(S. Echerd, personal communication), where lexical similarity 
percentages are high, yet prammatical and discourse differences 
hinder communication, Such hindrances in Quechua and Malay 


dialects have been described by speakers as, “I understand all 
the words, but I don't have the vaguest idea what they are 
saying.” 


Although Spanish and Portuguese have @9% lexical similarity 
(Rea 1958), intelligibility is hindered at least partially by 
complex morphophonemic changes in Portuguese which make 
Portuguese difficult for Spanish speakers to understand. Words 
that are true cognates can be unrecognizable to a Spanish speaker 
who has not had contact with Portuguese; Portuguese speakers have 
less difficulty with Spanish. 


A Word List as ar Adequate Sample of the Lexicon. There is also 
a question as to whether or not a given word list of 109 or 200 
words is an adequate sample of the total lexicon of a language 
for purposes of comparing that lexicon with the lexicon of 
another language (Gudschinsky 19562). If the Swadesh list truly 
reprasents words that are less likely to change through time, as 
he claimed, then the total lexicon of two languages would be less 
similar than the lexicostatistics indicate, The lists are not 
randomly selected, but carefully screened. If the lists are 
modified very much to eliminate culturally or geographically 
irrelevant terms, then the resulting percentages are joss 
meaningful when compared with the results from other lists which 
have not been modified, or have been modified in other ways. 
This is tllustratead in an extreme way by the following carefully 
selected lists, which could seem to indicate that American 
English and British English are not related: 


American English itish Enqlisn 
1. flashlight torch 
2. pitcher ug 
3. windshield wind screen 
4, car trunk boot 
5. car hood bonnet 
6. station wagon shooting brake, 
estate wagon 
7. truck lorry 
a. thumb tack drawing pin 
9. cookie biscuit 
10. biscuit scone 
11. French fries chips 
12. supper tea 
13. bedtime snack supper 
14. coffee break elevenses 
15. elevator lift 
P maj iste. The same informant can give 


differing word lists at different times. The area of meaning 
covered by apparent equivalents in two languages is often not 
identical: there is partial overlap and some difference in 
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meaning. The informant may pive one synonym one time, and 
another at a different time, with differing overlap in meaning. 
One time he may give a specific equivalent, another time a 
generic equivalent. One time he may give a loar: word, another 
time a vernacular word. One time a regionalism, another time a 
more universally known word. Different elicitors may evoke 
different responses from an informant because of the elicitors’ 
ethnic or linguistic affiliation, personality traits, or cultural 
status. Different settings for the taking of lists, such as 
school or home, may evoke more formal or more informal responses. 
Whether or not other people are present and who they are can 
bring about different responses. Different moods of the elicitor 
or responder may influence different responses. 


Diff: ists. Different inforeants from 
the same dialect can give lists that differ somewhat because they 
speak slightly different idiolects, or for any of the above 
mantioned reasons. 


Language of Elicitation. The language of elicitation may cause 
different responses. It may elicit mora borrowings fros itself 
than would ordinarily be used. If the language of elicitation is 
related to the language being investigated, it may elicit more 
cognates than would ordinarily be used, or cognates that have 
shifted meaning, but which are given because of the obvious “orm 
relationship, even though they are not a close meaningful 
equivalent. Different languages of elicitation used for 
collecting lists from different informants in the same dialect or 
from different dialects may cause different responses. 


t n Finding a Single Term. Hoijer (1956) pointed cut 
that a single term in the language of elicitation may not have a 
single equivalent in the dialect being investigated. In that 
case, an informant may not be able to give a response, or may 
arbitrarily choose one of several possible partial equivalents. 


ja for Several Items. Barbara D. Grimes (personal 
Communication) reporta that in Indonesia there was a prolem with 
the same term being used for several body parts, such as 'paim of 
hand? end "sole of foot', or for ‘red’ and "blood’, or 'water' 
and ‘river’, whereas these items are distinguished in the 
language of elicitation. 


Misunderstandings. Jack Keele (personal communication) fourc 
after he learned the Guayabero language of Colombia that an 
earlier list in Guayabero taken by someone who did not know tne 
language differed by 25% from his list. In one instance, the 
investigator had asked for the word for amarillo’ (yellow), and 
the informant had responded with the word for 'armadillot. In 
another instance, the investigator had asked for the word for 
‘man’, but because the language has separate words for 'humar 
being’, *mal and *Indian man', the informant chose a different 
one from that chosen in another dialect. Lists taken by 
investigators who know at 1 t one of the dialects are sore 


© 


Why Test Intelligibility? F. Grimes 13 


Pidpins and Creoles. Lexical similarity percentages can be 
misleading for pidgins and creoles, which may have high 
similarity percentages with each other or with their acrolact, 
but which are based on grammatical structures very different from 
their acrolect. It is necessary to do intelligibility testing in 
order to know how well they understand other creoles, or the 
acrolect, or standard language. 


Carolyn Miller (personal communication)? has reported 
situations in Sabah, Malaysia, where certain dialects appear to 
be Dusunic in lexicon but Murutic in grammar, Other mixed 
Situations are reported for the coastal plantations of Guatemala. 
The could be pidyins or craoles which have not yet been 
identified as such. Trying to determine intelligibility for them 
by comparing lexical similarity alone could be misleading. 


Hall (1966.124) pointe out, “for most languages of the 
arth, we do not know their previous history, and have no way of 
knowing whether or not--especially in tha case of apparently 
distantly related languages--a stage of pidginization and 
creolization may have entered inte the process of 
differentiation, This would seem especially likely in certain 
repions, such as Melanesia, where it is auspacted that 
pidginization may ba endemic as a linguistic behavior pattern; 
but who can tell whether similar processes may not have taken 
place elsewhere, in places and times no longer accessible to use 
by historical record?” 


Geographical Extent of the Language. Intelligibility testing 
makes the investigator aware of how far intelligible dialects 
extend, so he can determine adjustments in orthography, 
vocabulary, or grammar that should be made in literature for all 
those dialects to use (C. Walton, personal communication). 


Dialect Chajne. Dialect chains are made up of a series of 
dialects each of which has some intelligibility with those next 
to it, but which may not be able to understand those farther 
away, such as Arabic dialects which extend from Senegal to 
Afghanistan, or the Mumeng dialect chain in Morobe Province, 
Papua New Guinea. Analysis of the results of intelligibility 
testing divides thome chaine into optimal chunks for the most 
affective coverage. 


Use of Resources. Intwlligibility surveys are the most efficient 
uas of personnel, time, and other resources, both for surveys and 
= a basis for carrying out language programs. Some surveys have 
merely repeated earlier surveys by gathering longer word lists or 
by gathering word lists in additional places, but have still come 
out with an incenplate picture of the situation which could only 
then be clarified by testing intelligibility. 


Objections to Intelligibility Testing 


There ha 
testing beside 


been various objections made to intelligibility 
those discussed by Casad (1974.57-61) and Simons 
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11979. 13-16). Some of these are that it takes too tong, it is 
too much Work, it is too expensive because it requires two trips 
to @ach tasting point, it im too subjective, there are too many 
variabl shortcuts can be taken, and the ends can be 
accomplished by other means. 


In reply, it needs to be repeated that all other known 
methods try to gat at intelligibility by measuring something else 
whose relationship to intelligibility is variable from dialect 
pair to dialect pair. The time, effort, and expense is less in 
the long run than carrying out unnecessary language programs. 


Shortcuta have invariabley led to inconclusive results. 
either because some dialacts were left out which should have been 
tasted, or because the methods did mot distinguish other 
abilities from intelligibility, such as reading, writing, 
translation ability, or second language ability. 


There are variables in any kind of testing or measuring, and 
thus it is important to eliminate as many variables as possible 
in order to reduce the margin of error. It is important to be as 
consistent and thorough as possible, possibly reducing the nusber 
of dialects to be evaluated in the time available if necessary. 
The final proof of any survey is in the final outcome; does the 
literature in one dialect really reach adequately all the other 
dialacts the survey predicted it would reach? Survey evaluations 
need to be ongoing as literature is produced and literacy and 
distribution programs are carried out. [t may be that additional 
programs will bw needed in dialects that were originally thought 
to be closer than they really are, or that other dialects can be 
covered by a single program that were originally thought to be 
more distinct than they really are. 


In general, intelligibility testing has turned out to be 
fairly accurate. The main problems have come from setting the 
acceptable threshold of intelligibility too low (70% in Mexico) 
and not recognizing marginal intelligibility, from not screening 
out bilinguals from among the testees, from not reporting the 
standard deviation of scores and the number of subjects tested, 
from adjusting raw scores, and from not recognizing the 
importance of language ettitudes that hindered materials from 
being accepted which, were adequately understood (B. F. Grimes 
1982.32, 1984, 19850). 

` 


More on Intelligibility Testing 


Some additional understanding of intelligibility testing has 
been gained in recent years through surveys and the problems 
mentioned abovi Other aspects of intelligibility testing need 
further emphasis because they have tended to be neglected. 


Sample of Testees. Because testeen have often not been 
adequately screened to weed out those that have had contact with 
the dialect to be tested, the sample tested has often included 
individuals who have learned some of the related dialect or 
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One intelligibility test found ‘intelligibility’ between 
related varieties was 95% in one direction, but in the other 
direction was 25% to 38%, depending on the dialect tested. It 
wan concluded that these ware dialects of a single language, 
because 95% intelligibility indicates adequate understanding. 
Comparative studies which had been done previously were 
overlooked, although they indicated the two were not dialects, 
but separate languages. The 95% understanding was from 
bilingualism in the sample of speakers tested, and not from 
inherent intelligibility. 


In another survey, lexical similarity comparisons were not 
done, but 75% ‘intelligibility’ was reported for two languages 
that were in different linguistic groups too distantly related to 
have that much intelligibility. In addition, the 
‘intelligibility’ was with a major lingua franca of the region, 
which should be considered an important clue to possible 
bilingualism. 


In still another survey, some individuals from one town 
tested fairly high on one dialect, wher others from the sane 
town tested quite low on that dialect. Because the dialect bring 
tested was the market center for the ari it is now clear that 
the higher scores were from individuals who knew the market 
dialect. Therefore, inherent intelligibility ia really lower 
than the mean indicated. 


Iransletion and Use of a Second Language. A test method that 


Involves translating rather than answering questions in the 
vernacular (Simons 1979.9-1) is not able to distinguish between 
a person's ability to translate into a second language and his 
understanding of another dialect, which are two separate language 
skills on which individuals vary widely (B. F. Grimes 1966b.22). 
Even though this use of a second language enables an investigator 
to do intelligibility testing sooner than if he waited until he 
knew one of the dialects, and it requires less time and affort in 
setting up the tests, the results are lass reliable because of 
the added variables of translation ability and knowledge of and 
interference from a second language. This Kind of testing limits 
the sample of testeas to people who are bilingual. In a case 
where the second language is related to the dialects being 
tested, it is possible that bilinguals may have higher 
comprehension of another dialect even if they have not had 
contact with it before, because the knowledge of a related 
language may enhance their understanding of a new dialect 
(Sankoff 1968.184, 1969.848, Simons 1979. 114). 


Group Testing, In general, testing a group together is I 
reliable than testing individuals, for several reasons. If 
bilinguals have not been screened out, then those bilinguals will 
understand more than others in the group, and they may give the 
impression that inherent intelligibility iw greater than it 
really is. In face-saving societies, or societies where 
individuals tend to not disagree with their leaders, it may be 
difficult to determine if the spokesman for the group really 
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reflects group understanding. For most parts of the world 
homogeneity in bilingual proficiency cannot be ansumed; mhathar 
or not there is homogeneity can only be found out by sampling 
different age, sex, educational, and other relevant groups ir. 
each region. Even if bilinguals are eliminated from the sample 
thera are other uncontrolled variables in group testing. It is 
important to find out what individual understanding is, because 
in veal life the individual will usually not have the benefit of 
group explanations of meanings. The “highest group potential ~ 
(Simons 1979.15) may not be a realistic or useful indicator of 
intelligibility. 


Conclusions 


The testing methods described in Casaa (1974) are the best 
general mode) for arriving at valad results for dialect 
intelligibility investigations in a variety of situations. Otnar 
methods and shortcuts leave too many unanswered questions, anc 
need to be verified for each pair of dialects separately by 
calibrating them against intelligibility testing anyway. 
Experience has demonstrated the importance of screening testees 
to eliminate those with bilingual proficiency in the dialects to 
be tested; of watching for marginal intelligibilitys of reporting 
standard deviation, mean, and number of persons tested at eacn 
place; of not adjusting raw scores but rather eliminating faulty 
questions; and of making use of linguistic knowledge of the area 
based on the comparative method. The time and effort spent in 
carrying out these procedures is worth the resulting accurate 
picture of the situation that is gained. 


APPENDIX 


COGNATE RELATIONSHIPS OF SOME EUROPEAN LANGUAGES 


Spanish-Portuguese -89 
Italian-French 89 
Italian-Catalan -87 
Catalan-Portugue: «85 
Sardinian-Italian -85 
Italian-Portuguese -85 
Spanish-Catalan 85 
Italian-Spanish +82 
Sardinian-French 80 
Rheto-Romance-Italian +78 
Aneto-Ramance-French 78 
Sardinian-Port uguese -78 
Rumanian-Italian -77 
Rheto-Romance-Catalan 276 
Sardinian-Spanish -76 
Sardinian-Catalan 7S 
French-Portuguese 7 
Rumanian-French 278 
French-Spaniah ‚T 
Sardinian-Rumanian -74 
Rheto-Romance-5pani sh a74 
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